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INSECTICIDAL PEPTIDES 

This invention relates to insecticidal peptides and DNA sequences encoding them, 
processes for their manufacture and use, and transgenic plants transformed with constructs 
5 encoding said peptides. In particular the invention relates to insecticidal peptides isolatable 
from the genus Paecilomyces. 

Many fungi are pathogenic to insects. It is known that Peacilomyces fumosoroseus 
can be used as a biological control agents and this strain is sold commercially as a biocontrol 
agent for use in greenhouses. No work has, however, been carried out looking for gene- 
10 encoded insecticidal peptides from genus Paecilomyces. 

The applicants have found that insecticidal peptides can be extracted from of 
Paecilomyces spp and have now purified a new type of potent orally active insecticidal 
peptide from strains of Paecilomyces spp. in particular Paecilomyces farinosus. 

The present invention provides a peptide obtainable from the genus of Paecilomyces, 
15 and having insecticidal properties or a fragment thereof, or a homologue, variant or 

derivative of any of these which has insecticidal activity. Suitably the peptide is isolated in 
the sense that it is substantially free from other proteins and peptides with which is 
associated in Paecilomyces, Specifically, the protein in substantially pure form. 

The term "peptide" used herein encompasses both small peptides and large 
20 polypeptides including proteins. 

Once isolated from Paecilomyces, an insecticidal peptide can be characterised using 
conventional methods, and then fragments, variants of homologues having insecticidal 
activity can be derived. Suitably the peptide is an insecticidal peptide obtainable from the 
genus Paecilomyces, such as Paecilomyces farinosus. 
25 In particular, the invention provides an insecticidal peptide comprising the amino acid 

sequence 

GKICTPAGVKCPAALPCCPGLRCIGGVNNKVCR (SEQ ID NO 1) 

30 or a fragment thereof, or a homologue, variant or derivative of any of these which has 
insecticidal activity. 
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SEQ ID NO 1 is a peptide derivable from Paecilomyces farinosus. 

As used herein the expression "fragment" refers to any portion of the given amino 
acid sequence which has insecticidal activity either alone or when combined with other 
portions of the amino acid sequence. Fragments will suitably comprise at least 5, and 
5 preferably at least 10 consecutive amino acids from the basic sequence. 

The expression "homologues" as used herein refers to any peptide which has some 
amino acids in common with the given sequence. Suitably at least 60% of the amino acids 
will be similar, more suitably at least 70%, preferably at least 80%, more preferably at least 
90% and most preferably at least 95% of amino acids will be similar to the corresponding 
10 amino acid in the given sequence. 

As used herein the term "similar" is used to denote sequences which when aligned 
have similar (identical or conservatively replaced) amino acids in like positions or regions, 
where identical or conservatively replaced amino acids are those which do not alter the 
activity or function of the protein as compared to the starting protein. For example, two 
15 amino acid sequences with at least 85% similarity to each other have at least 85% similar 
(identical or conservatively replaced) amino acid residues in a like position when aligned 
optimally allowing for up to 3 gaps, with the proviso that in respect of the gaps a total of not 
more than 15 amino acid resides is affected. The degree of similarity may be determined 
using methods well known in the art (see, for example, Wilbur, W.J. and Lipman, D J. 
20 "Rapid Similarity Searches of Nucleic Acid and Protein Data Banks." Proceedings of the 
National Academy of Sciences USA 80, 726-730 (1983) and Myers E.and Miller W. 
"Optimal Alignments in Linear Space". Comput Appl. Biosci. 4: 1 1-17(1988)). One 
programme which may be used in determining the degree of similarity is the MegAlign 
Lipman-Pearson one pair method (using default parameters) which can be obtained from 
25 DNAstar Inc, 1228, Selfpark Street, Madison, Wisconsin, 53715, USA as part of the 
Lasergene system. 

Amino acids which differ from the basic sequence may be conservatively or non- 
conservatively substituted. A conservative substitution is to be understood to mean that the 
amino acid is replaced with an amino acid with broadly similar chemical properties. In 
30 particular conservative substitutions may be made between amino acids with the following 
groups: 



PED50431 

-3- 



(i) Alanine, Serine, Glycine and Threonine; 

(ii) Glutamic acid and Aspartic acid; 

(iii) Arginine and Lysine; 

(iv) Asparagine and Glutamine; 

5 (v) Isoleucine, Leucine, Valine and Methionine; 

(vi) Phenylalanine, Tyrosine and Tryptophan, 

In general, more conservative than non-conservative substitutions will be possible 
without destroying the insecticidal properties of the compounds. Suitable homologues may 
be determined by testing insecticidal properties of the peptide using routine methods, for 

10 example as illustrated hereinafter. 

The term 'Variant" as used herein includes experimentally generated variants or 
members of a family of related naturally-occurring peptides as may be identified by 
molecular genetic techniques. Such techniques are described for example in US Patent No. 
5,605,793, US Patent No. 5,81 1,238 and US Patent No 5,830,721, the content of which is 

15 incorporated herein by reference. In essence this technique involves expression of the 
parental gene in a microbial expression system such as Escherichia coli. The particular 
system selected must be validated and calibrated to ensure that biologically active peptides 
are expressed, which may be readily achieved using a in vivo bioassay. The gene, or 
preferably a collection of related genes from different species, may be subject to mutagenic 

20 polymerase chain reaction (PCR) as is known in the art. Fragmentation of the products and 
subsequent repair using PCR leads to a series of chimeric genes reconstructed from parental 
variants. These chimeras are then expressed in the microbial system which can be screened 
in the usual way to determine active mutants, which may then be isolated and sequenced. 
Reiteration of this molecular evolution DNA shuffling cycle may lead to progressive 

25 enhancement of the desired gene properties. The advantage of a technique of this nature is 
that it allows a wide range of different mutations, including multi-mutation block exchanges, 
to be produced and screened. 

Particular variants are those derivable from peptides of SEQ ID NO 1 or homologues 
thereof, in particular those isolable from Paecilomyces spp. . Other particular variants are 

30 those which are experimentally generated using for example the molecular evolution 

techniques. Preferably such variants will have improved insecticidal activity or function as 
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compared to the native sequences. Suitable improvements may be in relation to the intrinsic 
specific activity of the protein, the specificity or target range against which the peptide is 
active or by altering a physical property such as stability. 

In a further aspect, the invention provides the use of a peptide or peptides 
5 comprising SEQ ID NO 1, in the production of other insecticidal variants using molecular 
evolution and/or DNA shuffling methods. 

Other variants may be identified or defined using bioinformatics systems. An 
example of such a system is the FASTA method of W.R. Pearson and DJ. Lipman PNAS 
(1988) 85:2444-2488. This method provides a rapid and easy method for comparing protein 

10 sequences and detecting levels of similarity and is a standard tool, used by molecular 

biologists. Such similar sequences may be obtained from natural sources, through molecular 
evolution or by synthetic methods and comparisons made using this method to arrive at "opt 
scores" which are indicative of the level of similarity between the proteins. 

Particular variants of the invention will comprise insecticidal peptides with an amino 

15 acid sequence with a FASTA opt score (as defined in accordance with FASTA version 3.0t82 
November 1, 1997) against SEQ ID NO 1 of greater than 109, for example in excess of 120, 
more preferably in excess of 1 50 and most preferably in excess of 190. 

Variants which give FASTA scores in excess of 199 when compared with SEQ ID 
NO 1 are particularly preferred. 

20 With these constraints in mind, a skilled person would be able to isolate other 

members of the family of peptides, for example by designing probes or primers based upon 
SEQ ID NO 1 but modified within the limits of the FASTA opt score range. These probes 
could then be used to screen libraries such as cDNA or genomic libraries using conventional 
methods, in particular those derived from fungal species, in order to isolate other family 

25 members. Hybridisation conditions used during these screening exercises are either low or 
high stringency, preferably high stringency conditions as are routinely used in the art (see for 
example "Molecular Cloning, A Laboratory Manual" by Sanbrook et al, Cold Spring Harbor 
Laboratory Press, N. Y. ). In general terms, low stringency conditions can be defined as 3 x 
SCC at about ambient temperature to about 65°C, and high stringency conditions as 0.1 x 

30 SSC at about 65°C. SSC is the name of a buffer of 0.15M NaCl, 0.015M trisodium citrate. 3 
x SSC is three times as.strong as lx SSC and so on. 
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Once found other family members could also be subject to molecular evolution 
techniques or DNA shuffling as described herein, in order to improve the properties thereof. 
All peptides obtained in this way should be regarded as a variant within the ambit of the 
present invention. 

5 The term "derivative" relates to peptides which have been modified for example by 

using known chemical or biological methods. This includes acetylated or glycosylated forms 
of the peptides. It is believed that in particular the glycine at position 1 of the naturally 
occuring version of SEQ ID NO 1 is acetylated. 

In particular, in this case, the insecticidal peptides of SEQ ID NO 1 contains 

10 6 cysteine residues all of which are believed to be involved in forming 3 intramolecular 
disulphide bonds. Thus the arrangement of the cysteine residues may be important in 
conferring insecticidal activity on the peptide. Therefore, homologues or variants suitably 
retain the cysteine residues of the combination of SEQ ID NO 1 . 

Thus the invention encompasses peptides which may be represented as 

15 -AA 1 -AA 2 -AA 3 -Cys-AA 4 -AA 5 -AA 6 -A^ 

Cys-AA 15 -AAi 6 -AA I7 -AAi S -Cys-AA 19 -AA^ 

or a fragment thereof, wherein AA, -AA 27 refer to any amino acid other than cysteine. In 
this formula, at least some and preferably a substantial portion of AA, to AA 27 will be the 
same as in the corresponding amino acids in SEQ ID NO 1 above. In particular, it would be 

20 expected that at least 85% of the amino acids corresponding to AA! to AA 33 of SEQ ID NO 
1, are either identical or conservatively substituted as defined above. 

Peptides of the invention may be used alone or they may be fused to other peptides or 
proteins so as to form chimeric peptides or proteins. Suitably, the other peptide or protein of 
the chimera will have some insecticidal effect of its own, or will act as a targeting sequence 

25 to target the insecticidal peptide to a particular site in the target pest . 

The above described peptides may be prepared in various ways. For example, they 
may be extracted and purified from Paecilomyces isolates. However, since the peptide 
sequence is known, it may be more convenient to manufacture the peptides, either by 
chemical synthesis using a standard peptide synthesiser, or using recombinant DNA 

30 technology to express the peptide or variants of that peptide in suitable host cells. Suitable 
cells include prokaryotic or eukaryotic organisms, in particular micro-organisms such as E. 
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coli, Saccharomyces cerevisiae or Pichia pastoris. Nucleic acids encoding the peptides, as 
well as vectors, host cells and methods of producing the peptides form further aspects of the 
invention. 

In particular, the invention further provides a nucleic acid which encodes an 
5 insecticidal peptide as described above. The nucleic acid sequence may be a DNA or RNA 
sequence. In particular the DNA may be a cDNA sequence or a genomic sequence, and may 
be derived from a cDNA clone, a genomic DNA clone or DNA manufactured using a 
standard nucleic acid synthesiser. A particular nucleic acid sequence which encodes a 
peptide of SEQ ID NO 1 is the nucleic acid of SEQ ID NO 9 as shown in Figure 3. 

10 The DNA sequence may be predicted from the known amino acid sequence and DNA 

encoding the peptide may be manufactured using a standard nucleic acid synthesiser. 
Alternatively, the DNA sequence may be isolated from fungal-derived DNA libraries. 
Suitable oligonucleotide probes may be derived from the known amino acid sequence and 
used to screen a cDNA library for cDNA clones encoding some or all of the peptide. 

15 The sequence of the gene encoding the protein of SEQ ID NO 1 has been obtained 

(SEQ ID NO 2) and is shown in Figure 2 hereinafter. The sequence was determined using 
reverse transcription of RNA from these fungi and Rapid Amplification of cDNA Ends 
(RACE) by Polymerase Chain Reaction (PCR). The 3' region of the cDNA was isolated 
first, using the N-terminal amino acid sequence of the mature peptide for degenerate primer 

20 design. The specific sequence obtained from the 3 'RACE fragment was then used to design 
specific primers for amplification of the 5 ' region of the corresponding cDNA. Similar 
techniques could be employed for any other members of the same gene family. A schematic 
diagram of the gene sequence is shown in Figure I . As well as a signal sequence of 1 7 
amino acids, and a coding sequence for the peptide of SEQ ID NO 1 followed by a TAA stop 

25 codon, the gene has a 5' untranslated region (5'UTR) of approximately 110 base pairs and a 
3'untranslated region (3'UTR) of approximately 160 base pairs. 

Using the natural coding sequence SEQ ID NO 2 or fragments thereof such as SEQ 
ID NO 9, naturally occurring variants can be isolated. These will include DNA which 
hybridizes to the naturally occurring sequence or fragments thereof. Preferably, such 

30 hybridization occurs at, or between, low and high stringency conditions, which have been 
defined above. 
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Particular homologues of SEQ ID NO 1 are homologies identified in other 
insecticidal fungi either by protein purification and sequence analysis or by amplification 
from fungal genomes and/or cDNA preparations using PCR primers based on SEQ ID NO 1 . 

The peptide of SEQ ID NO 1 is a secreted peptide and therefore DNA encoding it 
5 which is isolated from Paecilomyces spp contains a signal sequence in addition to the DNA 
sequence encoding the mature peptide. Figure 2 illustrates the signal sequence in bold type, 
whilst and the sequence of the mature peptide is underlined. The signal sequences (SEQ ID 
NOS 7 and 8 below) form a further aspect of the invention. 

Oligonucleotide probes or cDNA clones may be used to isolate the gene or genes 
10 which encode the insecticidal peptide by screening genomic DNA libraries. 

The DNA sequence encoding the insecticidal peptide may be incorporated into a 
DNA construct or vector in combination with suitable regulatory sequences (promoter, 
terminator, etc). Genes encoding the peptides of the invention can be expressed in a variety 
of systems, including monocotyledenous plants, dicotyledenous plants and microbial 
1 5 systems as would be known in the art. The components of the constructs such as the 

promoter, terminator, selectable marker can be chosen to ensure that the peptide is expressed 
well in the particular expression system. Examples are illustrated hereinafter. For example, 
the DNA sequence may be placed under the control of a constitutive or an inducible 
promoter (stimulated by, for example, environmental conditions, presence of a pest, presence 
20 of a chemical). Such a DNA construct may be cloned or transformed into a biological 

system which allows expression of the encoded peptide. Suitable biological systems include 
micro-organisms (for example, bacteria such as Escherichia coli, Pseudomonas and 
endophytes such as Clavibacter xyli subsp. cynodontis (Cxc); yeasts such as Saccharomyces 
cerevisiae and Pichia pastoris; viruses; bacteriophages; etc), cultured cells (such as insect 
25 cells, mammalian cells) and plants. The expressed peptide may be isolated and if necessary 
formulated, for use. Alternatively, the peptide may be expressed in situ or in vivo under 
circumstances where they will be directly brought into contact with the target pests. 

It has been found that the peptides of the invention are insecticidal whether applied to 
pests either orally or by injection. Pests affected in this way include lepidopteran pests, for 
30 example as illustrated hereinafter and dipteran species such as the fruit fly Drosophila 

melanogaster. Thus the invention further provides a method of killing or controlling insect 
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pests which comprises administering to said pests or to the environment thereof, a peptide as 
described above. 

For agricultural applications, the insecticidal peptide may be used to improve the pest 
insect-resistance or pest insect-tolerance of crops either during the life of the plant or for 
5 post-harvest crop protection. Pests exposed to the peptides are inhibited. The insecticidal 
peptide may eradicate a pest already established on the plant or may protect the plant from 
future pest attack. 

Exposure of an insect pest to an insecticidal peptide of the invention may be achieved 
in various ways, for example: 
10 (a) a composition comprising peptide may be applied to the insect or to the 

environment in which they live, in particular, to plant parts or the surrounding soil, using 
standard agricultural techniques (such as spraying); 

(b) a composition comprising a micro-organism such as an insect virus, 
genetically modified to express the insecticidal peptide may be applied to a plant or the soil 

15 in which a plant grows; 

(c) an endophyte genetically modified to express the insecticidal peptide may be 
introduced into the plant tissue (for example, via a seed treatment process); 

[An endophyte is defined as a micro-organism having the ability to enter into 
non-pathogenic endosymbiotic relationships with a plant host. A method of 

20 endophyte-enhanced protection of plants has been described in a series of patent applications 
by Crop Genetics International Corporation (for example, International Application 
Publication Number WO90/13224, European Patent Publication Number EP-125468-B1, 
International Application Publication Number WO91/10363, International Application 
Publication Number WO87/03303). The endophyte may be genetically modified to produce 

25 agricultural chemicals. International Patent Application Publication Number W094/1 6076 
(ZENECA Limited) describes the use of endophytes which have been genetically modified to 
express a plant-derived insecticidal peptide]. 

(d) DNA encoding an insecticidal peptide may be introduced into the plant 
genome so that the peptide is expressed within the plant body (the DNA may be cDNA, 

30 genomic DNA or DNA manufactured using a standard nucleic acid synthesiser). 
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Where method (a) or (b) above is used, the peptide or microorganism is generally 
applied in the form of an insecticidal composition. Such compositions, which form a further 
aspect of the invention, will generally further comprise an agriculturally acceptable carrier or 
diluent as is known in the art. Suitable carriers or diluents are solids or liquids. Concentrates 
5 in the form of solids or liquids may be prepared, which require dilution in water prior to 
application, for example by spraying. 

Preferably, the peptides of the invention are administered in accordance with method 
(d) above. Thus in a preferred embodiment, nucleic acids of the invention utilize codons 
which are particularly preferred in plants. 
10 Examples of preferred codon usage from cotton and maize plants is set out in Table 1 

Table 1 



Amino Acid 


emotion preicrence 


JYiaiZc preierence 


Alanine 


GCT 


GCC 


Argmme 


AGG 


AGG 


Asparagine 


AAC 


ACC 


Aspartic Acid 


GAT 


GAC 


Cysteine 


TGC 


TGC 


Glutamine 


CAA 


CAG , 


Glutamic Acid 


GAG 


GAG 


Glycine 


GGT 


GGC 


Histidine 


CAT 


CAC 


Isoleucine 


ATT 


ATC 


Leucine 


CTT 


CTG 


Lysine 


AAG 


AAG 


Methionine 


ATG 


ATG 


Phenylalanine 


TTC 


TTC 


Proline 


CCT 


CCG 


Serine 


TCT 


AGC 


Threonine 


ACT 


ACC 


Tryptophan 


TGG 


TGG 




60431 



-10- 



Tyrosine 



TAC 



TAC 



Valine 



GTT 



GTG 



The table shows that the codon usage for these two species is fairly similar, and even those 
preferred codons which differ are fairly compatible (e.g. they are the second most preferred 
codon). Thus a codon optimized DNA coding sequence for maize and cotton in respect of 
5 SEQ ID NO 1 can be readily generated. 

Plant cells may be transformed with recombinant DNA constructs according to a 
variety of known methods (Agrobacierium Ti plasmids, electroporation, microinjection, 
microprojectile gun, etc). The transformed cells may then in suitable cases be regenerated 
into whole plants in which the new nuclear material is stably incorporated into the genome. 

10 Both transformed monocotyledonous and dicotyledonous plants may be obtained in this way, 
although the latter are usually more easy to regenerate. Some of the progeny of these 
primary transformants will inherit the recombinant DNA encoding the insecticidal peptide(s). 

Thus the invention further provides a plant containing recombinant DNA which 
expresses an insecticidal peptide according to the invention. Such a plant may be used as a 

15 parent in standard plant breeding crosses to develop hybrids and lines having improved 
insect resistance. 

Suitably the recombinant DNA is incorporated such that it is expressed in a region of 
the plant which is subject to pest attack (such as the leaves) and is therefore ingested by the 
pest The DNA may comprise sequences which enhance or control this or which are 

20 necessary for the mature peptide to fold correctly. For example, the nucleotide sequence 
encoding the peptide may be under the control of a promoter which is expressed particularly 
in the desired tissues. Other methods of targeting the peptide are possible. For example, 
the nucleic acid may further comprise a signal sequence which targets the peptide to the 
apoplast (extra-cellular space) as a general expression location in the plant Suitable signal 

25 sequences include those derived for example from the Dahlia antifungal peptide Dm- AMP- 1 
and the Radish antifungal peptide Rs-AFPl, the Maize hydroxyproline-rich glycoproten 
(HRGP) signal peptide and the Tobacco PR-la signal sequence which are as follows: 
Dahlia: 

SEQ ID NO 4: -Met Val Asn Arg Ser Val Ala Phe Ser Ala Phe Val 
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SEQ ID NO 3:- ATG GTT AAT AG A TCT GTT GCT TTT TCT GCT TTT GTT 
Leu He Leu Phe Val Leu Ala lie Ser Asp He Ala 
CTT ATT CTT TTT GTT TTG GCT ATT TCA GAT ATT GCT 
Ser Val Ser Gly 
5 TCT GTT TCA GGA 

Radish: 

SEQIDN0 6:-Met Ala Lys Phe Ala Ser He lie Ala Leu Leu Phe 
SEQ ID NO 5:-ATG GCT AAG TTT GCT TCT ATT ATT GCT CTT TTG TTT 

Ala Ala Leu Val Leu Phe Ala Ala Phe Glu Ala Pro 
10 GCT GCA CTT GTT TTG TTT GCT GCA TTT GAA GCT CCA 

Thr Met Val Glu Ala 

ACT ATG GTT GAA GCT 
Maize HR.GP signal sequence: 

SEQIDN011:-Met Gly Gly Ser Gly Lys Ala Ala Leu Leu Leu Ala Leu 
15 SEQ ID NO 10:-ATG GGT GGC AGC GGC AGG GCT GCT CTG CTG CTG GCC CTG 
Val Val Val Ala Val Ser Leu Ala Val Glu lie Gin Ala 
GTG GTG GTG GCC GTG AGC CTG GCC GTG GAG ATC CAG GCC 

Tobacco PR- 1 a signal sequence 
20 SEQ ID NO 13:- Met Gly Phe Val Leu Phe Ser Gin Leu Pro Ser Phe Leu 
SEQ ID NO 12 ATG GGA TTT GTT CTC TTT TCA CAA TTG CCT TCA TTT CTT 

Leu Val Ser Thr Leu Leu Leu Phe Leu Val He Ser His 
CTT GTC TCT ACA CTT CTC TTA TTC CTA GTA ATA TCC CAC 

25 

Ser Cys Arg Ala 

TCT TGC CGT GCC , 



30 



A further suitable signal sequence is the signal sequence of the gene encoding SEQ ID NO 1 
from Paecilomyces farinosus which can be represented as follows: 



IP 
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SEQIDN0 8:-Met Gin lie Ser Ala Val He Val Ala Leu Phe Ala 
SEQ ID NO 7:-ATG CAA ATC TCC GCC GTC ATT GTC GCA CTC TTC GCC 

5 Ser Ala Ala Met Ala 

AGC GCC GCC ATG GCC 

These or other signal sequences can be used as pre-protein signals and the sequence 

10 encoding the insecticidal peptide of the invention may then be placed at the C-terminal end 
of the chimeric protein. 

Transgenic plants in accordance with the invention show improved resistance or 
enhanced tolerance to an insect pest when compared to a wild-type plant. Resistance may 
vary from a slight increase in tolerance to the effects of the pest (where the pest is partially 

15 inhibited) to total resistance so that the plant is unaffected by the presence of pest (where the 
pest is severely inhibited or killed). An increased level of resistance against a particular pest 
or resistance against a wider spectrum of pests may both constitute an improvement in 
resistance. Transgenic plants (or plants derived therefrom) showing improved resistance are 
selected following plant transformation or subsequent crossing. 

20 Examples of genetically modified plants which may be produced include field crops, 

cereals, fruits and vegetables such as canola, sunflower, tobacco, barley, sorghum, tomato, 
mango, peach, apple, pear, strawberry, banana, melon, potato, carrot, lettuce, cabbage, onion, 
etc. Particularly preferred genetically modified plants are sugar beet, cotton, maize, wheat, 
rice, soya spp, sugar cane, pea, field beans, poplar, grape, citrus, alfalfa, rye, oats, turf and 

25 forage grasses, flax and oilseed rape, and nut producing plants. 

As the insecticidal peptides of the invention are very active against some of the major 
cotton pests, it would be particularly advantageous to transform cotton plants with constructs 
encoding said peptides. Alternatively, the peptides may be supplied to cotton plants by any 
other suitable method. 

30 The invention still further includes the progeny of the plants of the preceding 

paragraph, which progeny comprises the said polynucleotide, or functionally sufficient parts 
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thereof, stably incorporated into its genome and heritable in a Mendelian manner and the 
seeds of such plants and such progeny. 

Plant transformation, selection and regeneration techniques, which may require 
routine modification in respect of a particular plant species, are well known to the skilled 



The insecticidal peptides of the invention may be employed alone or in combination 
with other agrochemicals such as herbicides, fungicides or, most suitably, other insecticidal 
compounds such as insecticidal peptides and proteins. Thus insecticidal compositions in 
accordance with the invention may comprise additional agrochemical compounds. Where 
10 the other compounds are peptides or proteins, nucleic acids encoding these may be included 
in the composition in the form of expression vectors. Where these are used, the additional 
nucleic acids may be in the same vector as the peptide of the invention, or in additional 
vectors. 



15 protease inhibitors and insectidal proteins derived from species of the Bacillus thurigiensis , 
Xenorhadus nematophilus, or Photorabdus luminescens. 

The invention will now be described by way of example only with reference to the 
drawings, in which: 

Figure 1 is a schematic diagram of the organisation of the Paecilomyces farinosus gene; 
20 Figure 2 shows the nucleotide sequence of the natural gene encoding SEQ ID NO 1 as well 
as the amino acid sequence, with the signal sequence shown in bold type and the sequence of 
the mature peptide underlined; 

Figure 3 shows SEQ ID NO 9 which is the natural coding sequence of the mature peptide of 
SEQ ID NO 1; and 

25 Figure 4 shows diagrammatically a construct suitable for the transformation of corn where A 
illustrates the signal-gene fusion part of the construct and B illustrates a backbone vector. 

Example 1 

Culturing of Paecilomyces farinosus 
30 Paecilomyces farinosus was routinely cultured on potato dextrose agar plates. Spores were 
harvested from the plates by adding sterile water and scraping with a sterile spatula. For 



5 



man. 



Examples of possible mixture partners include insecticidal lectins, insecticidal 
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production of insecticidal peptide 6xl0 7 spores were inoculated into 5x 200ml of SDB 
medium in 500ml flasks. Cultures were incubated at 24°C with shaking at 180rpm for 7 days 
before harvest. 

5 Example 2 

Purification of insecticidal peptide 

500ml of 7d culture filtrate was filtered through Whatman GF/B paper to remove mycelium 
and the supernatant diluted 4 fold in 20mM MES pH6. The supernatant was then loaded 
onto a S-Sepharose FF XK16/10 column (Pharmacia Biotech) previously equilibrated with 

10 20mM MES pH6. Unbound protein was washed through the column with 3 column volumes 
of 20mM MES pH6 and bound protein was eluted with a linear gradient of 0-1M NaCl in 
20mM MES pH6 ovr 20 column volumes. The eluate was monitored for peptide by online 
measurement of absorbance at 280 and 210nm. 5ml fractions were collected and following 
dialysis against 50mM Sodium Phosphate buffer pH7 assayed against Heliothis virescens. 

15 Active fractions eluted around 250mM NaCl . These fractions were pooled and 

following concentration on Polyethylene glycol Mwt 20,000, were further purified by reverse 
phase. 2ml of sample was loaded onto a 3ml Resource RPC column (Pharmacia Biotech) 
and bound peptide eluted with a linear gradient of 0.05% trifluoroacetic acid (TFA) to 50% 
acetonitrile, 0.0% TFA over 20 column volumes. The eluate was monitored for absorbance 

20 at 21 0 and 280nm. The active peak eluted at approximately 20% acetonitrile. 

Example 3 

Identification of peptide sequence 

The sequence of the active peptide in the product of Example 2 could not be determined 
25 directly, probably due to a blocked N-terminus. The peptide was reduced and subjected to 
and tryptic digestion. This yielded a series of fragments which could be sequenced using 
Edraan degradation methods. Using a combination of this, and mass spectrometry, the 
sequence of the peptide was determined as being SEQ ID NO 1 . The mass spectrometry data 
indicates that the N~terminal glycine is acetylated. 

30 

Example 4 
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Biological Activity in insect bioassav 

The isolated peptide was bioassayed against a range of insect species using the 
following method: 

Prior to the assay twenty neonate lepidoptera larvae were gently brushed into each of 
5 three 'minipots' containers per treatment (i.e. three replicates per treatment). The peptide 
from Example 2 was diluted using 0.1% Synperonic™ solution to act as a wetter and aid the 
spread of the material over the waxy leaf cuticle. In spectrum assays, test materials were 
made up to a single high concentration, whereas in potency assays vs. K virescens a rate 
range was tested. 

10 Three freshly excised cotton leaves per treatment had 0.1 ml of the appropriate 

treatment applied by pipette to the centre of the axial surface of each leaf. The droplet was 
then spread over a circular area in excess of the diameter of a minipot with a fine artists paint 
brush (a fresh paint brush being used for each compound to avoid contamination). The 
leaves were left in a fume cupboard just long enough for the surface deposit to dry but care 

15 was taken to avoid excessive leaf wilting. 

Once dry the leaves were placed, contaminated surface down over the appropriately 
labelled minipot and a lid snapped over it. The minipots were placed in plastic trays and 
held in a controlled temperature at 25-27°C. 

After three days the numbers of live larvae remaining were counted and percent 
20 mortality determined. In the K virescens potency assay the test data was run through a logit 
analysis package to establish the LC 30 . 

The results for 4 lepidopteran pests are shown in Table 2. 

Table 2 



Test Species Rate fpp m) %kill 
Heliothis virescens 1000 100 
Helicoverpa zea 1000 100 
Spodoptera exigua 1000 100 
Plutellaxylostella 1000 100 



25 b) Cell cytotoxicity 
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Two celi lines were used to determine if the peptide from Example 2 was cytotoxic to 
either mammalian cells (MEL cells) or insect cells (Sf21 cells), MEL cells and SGI cells 
were grown in DMEM and TCI 00 media respectively in 96-well microtitre plates and 
incubated with the appropriate concentration of peptide. The cells were scored for visible 
5 cell death after 24 hours and viability and growth assessed after 3 (MEL cells) or 4 (Sf21) 
days using the reduction of MTT (3-(4,5-Dimethylthiazol-2~yl>2,5-diphenyltetrazolium 
bromide) to form an insoluble purple formazan as a marker for metabolically active cells. 

At the highest rate tested (100 }ig/ml) the peptide did not inhibit cell growth or cause 
any cytotoxic effects on either cell line. 

10 

Example 5 

Comparison of known protein sequences to SEQ ID NO 1 using the FASTA Algorithm: 

A FASTA comparative search of SEQ ID NO 1 to a database of protein sequences 
was carried out 

15 SEQ ID NO 1 was compared to all publicly available protein sequences using the 

FASTA method (FASTA version 3.0t82 November 1, 1997 Reference: W.R. Pearson & D.J. 
Lipman PNAS (1988) 85:2444-2448). 

Specifically a large non-redundant protein database, including release 36.0 of 
SWISS-PROT, queried on 16 June 1999 returned proteins judged to have some similarity to 

20 SEQ ED NO 1 . The best way to judge similarity using FASTA by those skilled in the art is to 
use the opt score output The comparison of SEQ ID NO 1 to the non-redundant protein 
sequence database gave very few proteins and none with a high opt score demonstrating that 
SEQ ID NO 1 is not closely related to any known proteia The 'most similar 1 was in 
GENESEQP and had an opt score of 109. 

25 

Example 6 

Characterisation of natural coding sequence of peptide of SEP ID NO 1 
Harvestin g of Material 
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A Paecilomycesfarinosus strain having insecticidal activity was grown in Sabouraud 
Dextrose Broth (Difco Laboratories: lOg Bacto Neopeptone, 20g Bacto Dextrose per litre 
water) for 5 days at 24°C with shaking at 1 80 rpm. 

5 The culture was pelleted (8000rpm, 1 0 minutes) and stored at -80°C until use. 
RNA Extraction 

Harvested material was ground to a fine powder using a pestle and mortar under liquid 
nitrogen. RNA was extracted from 1 g of fungal pellet using the Qiagen RNeasy kit, 
10 following manufacturers specifications. The total RNA fraction was eluted from the RNeasy 
purification column in 1ml water. 

Poly(A)+ RNA was isolated from the 700mg total RNA using the Promega PolyATtract 
mRNA isolation system I, following manufacturers' specifications. The Poly(A)+ RNA 
15 fraction was eluted from the magnetic beads in 1ml water, and concentrated to 1 5\xm\ (of 
approximately 0.5mg/ml) by ethanol precipitation. 

RNA samples were stored at -80°C until use. 

20 In the following reactions, the primers and probes used are summarised in Table 3. 



Table 3 
Primer/Probe sequences 
Designation Primer Sequence SEQ ID NO 

Anchorl TTTTTTTn 1 1 1 TTTTTACGCCGGCGCTTAA 14 

GTACGCTCGGGCT 

Anchorl - Rl TCGGGCTCGCATGAATTCG 1 5 

Anchorl - R2 ATGAATTCGCGGCCGCAT 16 

Anchorl - R3 TCGGGCTCGC ATGAATTCGCG 1 7 

Anchorl - R4 CTCGCATGAATTCGCGGCCGC 1 8 

Fl ATHTGYCANCCNGCNGG 19 

F2 ATHTGYCANCCNGCNGGNGT 20 
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F3 CANCCNGCNGGNGTNAA 

F4 CCNTGYTGYCCNGGNYT 

F5 TNAARTGYATHGGNGG 

F6 GGNGTNAAYAAYAARGTNTG 

F7 AARATHTGYACICCIGCIGGIGTIAA 

F8 CCIGCIGGIGTIAARTGYCCIGCIGC 

F9 TGYCCIGCIGCIYTICCITGYTGYCC 

F10 TGYATHGGIGGIGTIAAYAAYAARGT 

F 1 1 TAAATGTCCCGCGGCTCTTCC 

F12 CGGCTCTTCCTTGCTGCCCCG 

F13 TGCTGCCCCGGACTTCGCTGC 

Anchor3 HO-GTTTA ATTACCC AAGTTTGAGNNNNN - 
NH 2 

Anchor3 - attach P0 4 - CTCAAACTTGGTAATTAAACC - NH 2 

Anchor3 - F 1 GGTTTAATTACCCAAGTT 

Anchor3 - F2 TAATTACCCAAGTTTGAG 

Anchor3 - F3 GGTTTAATTACCCAAGTTTGAG 

Rl CAIACYTTRTTRTTIACICCICC 

R2 ATGCAGCGAAGTCCGGGGCAG 

R3 GGGGCAGCAAGGAAGAGCCGC 

R4 AAGAGCCGCGGGACATTTAAC 

Probe F AGTTAAATGTCCCGCGGCTCTTCCTTGCTG 

CCCCGGACTTCGCTGCATC 

Probe R GATGCAGCGAAGTCCGGG 



where "P designates a forward primer and "R" designates a reverse primer 

I RACE PCR 
First strand cDNA synthesis 
The ClonTech Advantage RT-for-PCR kit was used for this, in accordance with the 
manufacturers' specifications. 
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20 prnoi oligo(d)T primer 4 Anchorl ' was annealed to 1 (ig total RNA in a total volume of 
13.5 fil by heating to 70°C for 2 minutes and rapidly quenching on ice. 

5 The following reaction components were added: 

4 5 x reaction buffer 
0.5 \xl RNase inhibitor 

1 |il MMLV reverse transcriptase 
l^llOmMdNTP 

10 

The reaction was incubated at 42°C for 1 hour, and the reaction stopped by denaturing the 
enzyme at 95°C for 5 minutes. 80 ^il RNase free water was added and cDNA stored at - 
80°C. 

15 3'RACEPCR 

5 \xl of reaction mix from the first strand cDNA synthesis reaction was used as a template 
with various primer set combinations to amplify the 3' end of the peptide coding cDNA. The 
primers (see Table 3) used were degenerate, and designed on the known amino acid 
sequence of the N-terminal end of the mature peptide to allow for selective amplification. 

20 

PCR reactions were performed using Ready-To-Go PCR Beads (Amersham Pharmacia 
Biotech). These contain all necessary components for a PCR reaction as a bead in a 0.5 ml 
tube. The following components were added: 

5 \xl reaction mix from cDNA synthesis step 
25 pmol 
25 pmol 

to a final volume of 25 \il . 
30 PCR cycle conditions 



25 cDNA template 
Forward primer 
Reverse primer 
Sterile Water 
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95 °C lmin 



95 °C 1 min 



5 



i 



64 °C* 1 min 



30 cycles 



72 °C lmin 



10 



72 °C 10 mm 



* Annealing temperature varied depending on primer set 



15 PCR products were visualised by agarose gel electrophoresis on a 1% agarose gel in TBE 
buffer. Discrete PCR products were cloned into pCR2. 1 TOPO using the Invitrogen TOPO 
TA cloning kit according to the manufacturers' specification. 

Each ligation contained 
20 ljJ PCR product 

1 ill pCR2.1 TOPO vector 
3 jil Sterile Water 



and was incubated at room temperature for 5 minutes. 2 \il of each ligation mix was 
25 transformed into TOPI 0 competent cells by heat shock at 42°C for 30 seconds followed by 
incubation on ice for 2 minutes. Transformed cells were allowed to express beta-lactamase 
by incubation at 37°C in SOC medium (2% tryptone, 0.5% yeast extract, lOmM NaCl, 
2.5mM KC1, lOmM MgCl 2 , lOmM MgS0 4 , 20mM glucose) for 1 hour with shaking at 225 
rpm. 



30 
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Cells were plated on Luria-Bertani Agar plates (1 .0% tryptone, 0.5% yeast extract, 1 .0% 
NaCl, 15g/L agar, 0.006% X-gal, 0.1 5mM IPTG) containing SO^ig/ml kanamycin for plasmid 
transformant selection and to enable identification of those containing recombinant TOPO 
TA isolates. 

5 

Discrete white colonies were selected from different PCR TOPO TA reactions, grown 
overnight in 5 ml Luria-Bertani (1.0% tryptone, 0.5% yeast extract, 1.0% NaCl, in water, pH 
7.0) containing 50^g/ml kanamycin. 

10 Plasmid DNA was extracted from the cultures using the Wizard DNA purification kit 

(Promega), following manufacturers' specifications. DNA was eluted in 50\xl sterile water. 

Plasmid DNA was digested with EcoRI to confirm the presence and size of inserts. 

3 jLil Plasmid DNA 

1 \xl EcoRI (Kmrnd Biotech) 
15 1 p.1 1 0 x Restriction Buffer 6 (Rramel Biotech) 

5 fil Sterile water 

Digests were incubated at 37°C for 2 hours and the presence or absence and size of inserts 
determined by agarose gel electrophoresis. 

20 

Based on these analyses, recombinant plasmids were selected for sequencing on a Perkin 
Elmer ABI 377XL DNA sequencer with the ABI Prism dye terminator cycle sequencing 
ready reaction kit, according to the manufacturers' protocol. 

25 4 pmol primer Ml 3 Univ or Ml 3 R 
5 fil DNA 

Sterile water to 1 2 fil 

The coding sequence of the peptide of SEQ ID NO 1 was identifiable by translation of the 
nucleotide sequence into amino acid sequence in all possible reading frames and comparison 
30 of this sequence to the known amino acid sequence of the peptide. This analysis used the 
DNA Star sequence analysis software (SeqMan, EditSeq, Macaw, VectorNTI). 



0)50431 



-22- 



5'RACEPCR 

An anchor-ligation approach (Troutt, A.B., et al., Proc. Natl Acad. Set USA. 89, 9823- 
5 9825) was used to obtain the nucleotide sequence of the 5 * half of the 524445 gene. This 
entailed attachment of a specific anchor primer to the 5' end of the first strand cDNAs. Use 
of this sequence together with inKNA specific for the peptide of SEQ ID NO 1 
complementary 3' primers allowed for selective amplification of the 5' end of the 
corresponding coding cDNA. 

10 

i. Primer annealing 

Complementary oligonucleotides Anchor3 and Anchor3-attachment were annealed to each 
other in equimolar ratio's at three different final concentrations (InM, lOOnM, lOmM). 

15 Oligonucleotide mixtures were heated to 95°C and cooled slowly to 45°C for annealing. 

ii. Ligation of annealed primer to cDNA 

The attachment primer is complementary to the anchor primer, but contains a 3 ' extension of 
5 additional fully degenerate bases i.e. synthesised with A, G, C and T at each position. This 
20 degenerate 'tail' allows individual attachment primers to anneal to the 3' terminus of any 
cDNA molecule. An amido group at the 3* end of the primer blocks DNA synthesis. A 
phosphate group at the 5' end of the anchor primer allows ligation of this to the 3' end of the 
cDNA molecules to provide a specific recognition sequence for PCR amplification. 

25 Reactions for ligation of annealed anchor primers to first strand cDN A preparations 
contained: 

5 \xl reaction mix from first strand cDNA synthesis 



30 10 mM 



30 mM 



10 mM 



TrisHCl(pH8) 
MgCl 2 

Dithiothreitol 
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0,5 mM 



ATP 



ltd 

1 jil Water 

ini 



T4 DNA Iigase (4 U/jal) (Kramel Biotech) 



Annealed anchor primers (final concentrations of lOOmM, lOnM, ImM) 



5 



Reactions were cycled overnight as follows: 



25 °C 5min 



10 



Slow Ramp Rate 



4 °C 5 min 



15 Reactions were pooled, incubated at 95°C for 5 minutes and snap frozen in liquid nitrogen. 
After thawing on ice, excess primers were removed by purification through a Wizard PCR 
clean-up column (Promega) using the manufacturers' specifications. cDNAs were eluted in 
40 pi water. 

20 iii. RACE PCRs 

PCR reactions were set up using the anchor-linked cDNA as a template, specific forward 
primers based on this anchor sequence, and specific primers based on the gene sequence of 
the peptide of SEQ ID NO 1 identified previously by 3'RACE. 



PCR reactions were performed using Ready-To-Go PCR Beads (Amersham Pharmacia 
Biotech) as used previously for 3' RACE. Components added to the PCR beads were: 



25 



30 20 pmol 
20 pmol 



cDNA template with Anchor3 annealed to 3' end of first strand cDNA 
Forward primer 
Reverse primer 



w 
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Sterile water to total volume of 25 fxl. 
PCR cycle conditions were: 

95 °C 1 min 



10 



95 °C 1 min 
i 

58 °C* 1 min 
i 

72 °C lmin 



30 cycles 



72 °C 10 min 



15 



* Annealing temperature varied depending on primer set 



PCR products were visualised by agarose gel electrophoresis and TOPO cloned as described 
above. Plasmid DNA was extracted from clones carrying candidate recombinant plasmids by 
Wizard miniprep, EcoRI digested and sequenced, as performed previously for 3' RACE 
20 clones (described above). 



25 



II. cDNA Library 
Library Construction 

A cDNA library of the fungus Paecilorrryces farinosus was constructed using the lambda- 
ZAP cDNA synthesis and ZAP-cDNA Gigapack III Gold Cloning kit from Stratagene, 
according to the manufacturers' specifications unless stated. 



Double stranded cDNA was synthesised using 5 \ig the mRNA from the peptide of SEQ ID 
NO 1 (see above) as a template. This involved first and second strand cDNA synthesis, 
30 blunting of cDNA terminii, ligation of adapters, and digestion with specific restriction 
enzymes to produce appropriate 'sticky ends' for directional cloning. 
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A Sephacryl S-400 HR MicroSpin column (Amersham Pharmacia Biotech) was used to 
remove excess adapters rather than the size fractionation step suggested in the kit. 
The gel filtration medium provided in the kit (sepharose CL-2B) separates molecules on the 
5 basis of size with a cut-off of 400bp. As the mature insecticidal peptide is only 33 amino 
acids long, it is highly likely that the gene may be smaller than 400bp and would have been 
selected against using the sepharose filtration medium. 

cDNAs were ligated into the Uni-ZAP XR vector and packaged into phage. The library titre 
10 was 2.5 million clones, with an average insert size of 700bp ranging from 150bp to 2 Kb. 



Library screening 

A total of 500,000 plaques were plated on Luria-Bertani Agar plates according to the cDNA 
15 library manufacturers 1 specification. Duplicate lifts of the plaques were made onto 
nitrocellulose membrane (Hybond-N, Amersham Pharmacia Biotech). 

The membranes were prehybridised in Denhardts hybridisation solution (5x SSPE, 5x 
Denhardfs Reagent [50x Denhardt's Reagent: 5g Ficoll, 5g polyvinylpyrrolidone, 5g bovine 
20 serum albumin, sterile water to 500ml], 0.5% SDS, sterile water to 1L) containing 

200^lsalmon sperm DNA (lOmg/ml) which had been denatured by boiling for 10 minutes, 
for 2 hours at 65°C. A radioactive probe was prepared by end labelling an oligonucleotide 
specific for the coding sequence of a peptide of SEQ ID NO 1 : 

25 25 ng Oligonucleotide (445-F1 1) 

1 p.1 Polynucleotide Kinase Buffer (Kramel Biotech) 

1 .5 |il T4 Polynucleotide Kinase (Kramel Biotech) 

5 jul gamma 32PdATP 

Sterile Water to 10 [il 



30 
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The probe was incubated at 37°C for 5 hours. The probe was added to 50ml Denhardts 
hybridisation solution and hybridised overnight at 65°C 

Membranes were washed in a O.lx SSC, 0.1% SDS solution for 4 x 15 minutes to remove 
5 unbound probe. Exposure to x-ray film identified positive plaques containing 524445 coding 
sequence. 

Positive plaques were cored from the original agar plates into 1ml SM buffer (5.8g NaCl, 2g 
MgS0 4 .7H 2 0, 50ml 1M Tris-Hcl pH 7.5, 5ml 2% gelatin, sterile water to 1L) containing 
10 20ml chloroform and vortexed. The phage DNA was allowed to enter the phage buffer by 
storage at 4°C overnight. Samples of phage were then diluted and re-plated to obtain 
approximately 200 plaques per plate. The plaque lift and hybridisation procedure above was 
repeated to identify positives. 

1 5 This process was followed for three rounds of screening until the plaques were pure. Self 
excision of 12 candidate positive plaques into colonies was performed as per Stratagene's 
specifications with the cDNA library kit 

Candidate colonies were grown overnight in 5ml Luria-Bertani medium containing 100fig 
20 /ml ampicillin, plasmid DNA extracted using Promega's Wizard miniprep kit, and the inserts 
sequenced using M13 Universal and Ml 3 Reverse primers (see above for details of all). 

Nucleotide Sequence 

25 The nucleotide sequence of the peptide of SEQ ID NO 1 is shown in Figure 2. The putative 
translation initiation codon and stop codon are shown in italics. The sequence which codes 
for the mature peptide is underlined. 

The sequence in Figure 2 indicates that there is approximately a 1 10 nucleotide 5' non- 
30 coding sequence and a 160 nucleotide 3' non-coding sequence. There seems to be a 17 
amino acid signal peptide 5' of the mature coding sequence. Potential signal sequence 
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cleavage sites were predicted based on the method of von Heijne, G. (1986). Nucleic Acids 
Research. 14, 4683. The potential cleavage site is indicated by a downward pointing arrow. 
It is probable that a secondary processing event removes the signal peptide from the mature 
peptide, e.g. by signal peptidase cleavage. 

5 

Example 7 

Expression of peptide in com 

European Corn Borer {Ostrinia nubilalis) and therefore corn, Zea mays, which has 
been transformed so as to express this peptide would be expected to be protected against this 
10 pest. 

Suitable constructs for expression in corn can be summarised as follows: 

Construct Promoter Signal Peptide Gene Terminator 

15 1 Maize Ubi SEQIDNO10 SEQ ID NO 9 + nos 

2 Maize Ubi SEQ ID NO 3 SEQ ID NO 9 + nos 

3 Maize Ubi SEQ ID NO 7 * SEQ ID NO T nos 

* This signal peptide contains an internal Ncol site which can be mutated ( for example 
20 CCATGG -> CTATGG) to destroy it if Ncol is required for cloning. 

+ The natural coding sequence can be modified in accordance with the degeneracy of the 
genetic code, and in particular for the purpose of codon optimisation in corn. 

25 The signal peptide can be fused to the mature gene for example using an overlapping 

PCR approach as illustrated in Figure 4. The fusion is suitably designed with restriction sites 
to allow cloning into monocot vectors. For example, it may comprise the following: 



5' Ncol-Kpnl Signal Gene Sad- Hindlll 3' 



30 
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The full length signal-gene fusion can be ligated between the maize ubiquitin promoter and 
nos terminator into a backbone vector containing PAT selection (phosphinothricin - basta 
herbicide resistance). 

These constructs can be used to transform corn cells which can then be grown into 
5 callus as is well known in the art. The transformed callus can be subjected to a corn callus 
transient assay and/or an in vivo bioassay to confirm expression and activity of the peptide. 

Example 8 

Expression of Peptide in Cotton 
10 The peptide of the invention has good activity against the Beet Armyworm 

{Spodoptera exigua) which is a major cotton pest. Thus cotton Gossypium hirsutum, which 
has been transformed to express this peptide would be protected against this pest. 

Suitable constructs for use in the transformation in this case can be summarised as: 

15 Construct Promoter Signal Peptide Gene Terminator 

4 RolDFd SEQ ED NO 13 SEQ ID NO 9 potato protease inhibitor II 

5 RolDFd SEQ ID NO 3 SEQ ID NO 9 potato protease inhibitor II 

6 RolDFd SEQ ID NO 7 SEQ ID NO 9 potato protease inhibitor II 

20 The signal peptide can be fused to the mature gene using an overlapping PCR 

approach as in Example 7. In this case, the fusion is suitably designed with restriction sites 
to allow cloning into dicot vectors. The full length signal-gene fusion can be ligated into a 
housekeeping vector between the RolDFd promoter and potato protease inhibitor II 
terminator. The entire cassette could then be cut out using restriction enzymes and ligated 
25 into an appropriate binary vector. 

Constructs can then be tested using conventional methods. 
Other modifications of the present invention will be apparent to those skilled in the 
art without departing from the scope of the invention. 
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Claims 

1 . A peptide obtainable from the genus of Paecilomyces, and having insecticidal 

5 properties, or a fragment thereof, or a homologue, variant or derivative of any of these which 
has insecticidal activity, 

2. A peptide according to claim 1 which is obtainable from the genus Paecilomyces. 

10 3 . A peptide according to claim 2 wherein the Paecilomyces spp. is Paecilomyces 
farinosus. 

4. An insecticidal peptide comprising the amino acid sequence 

15 GKICTPAGVKCPAALPCCPGLRC1GGVNNKVCR (SEQ ID NO 1) 

or a fragment thereof, or a homologue, variant or derivative of any of these which has 
insecticidal activity. 

20 5. An insecticidal peptide according to claim 4 of SEQ ID NO 1 . 

6. An insecticidal peptide according to claim 4 which gives a FASTA score in excess of 
1 09 when compared with SEQ ID NO I. 

25 7. An insecticidal peptide according to claim 6 wherein the FASTA score is in excess of 
199. 

8. An insecticidal peptide according to claim 4 which is of the sequence 
-AA 1 -AA r AA 3 -Cys-AA 4 -AA 5 -AA 6 -AA 7 -A^ 
30 Cys-AA 15 -AA 16 -AA ir AA 18 -Cys-AA,^ 

or a fragment thereof, wherein AAj -AA 27 refer to any amino acid other than cysteine. 
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9. An insecticidal peptide according to any one of the preceding claims which is fused 
to another peptide or protein, 

5 10. A nucleotide sequence which encodes an insecticidal peptide according to any one of 
the preceding claims. 

11. A nucleotide sequence according to claim 1 0 which is of SEQ ID NO 9 as shown in 
Figure 3. 

10 

12. A vector comprising a nucleotide sequence according to claim 10 or claim 1 1. 

13. A cell transformed with a vector according to claim 12. 

15 14. A cell according to claim 1 3 which is a plant or microbial cell. 

15. A plant which has been transformed with a vector according to claim 10 or claim 1 1. 

1 6. A method of killing or controlling insect pests which method comprises 

20 administering to the pest or to the environment thereof, a peptide according to any one of 
claims 1 to 9. 

1 7. A method according to claim 16 wherein the peptide is expressed by an organism 
found in the environment of the pest 

25 

18. A method according to claim 17 where the organism is a plant. 

19. A signal peptide comprising a peptide of SEQ ID NO 8. 

30 20. A nucleotide sequence which encodes the signal peptide according to claim 1 9. 



PJ50431 



31 



21. A nucleotide sequence according to claim 20 which comprises SEQ ID NO 7. 
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Figure 1 



Mature Peptide Coding Sequence 




(438 bp) 

Figure 2 
(SEQ ID NO 2) 

1 ATTACCCAAG TTTGAGGGCA TTCAATTTCA CACAGTCTCA CGCTTTCGAC 
51 GCATCTACTT CTTCGTCTCA CGCCATATAT CCTCCCAAAA TCACACCTCT 

+2 M*t GlnIl«S*r AlaValll* ValAloLauPhe AlaSerAl 

101 TCCTTCACCA TGCAAATCTC CGCCGTCATT GTCGCACTCT TCGCCAGCGC 

+2 aAlaMe)tAla ^GlyLysIleCys ThrProAla GlyValLys CysProAlaA 
151 CGCCATGGCC GGCAAGATCT GCACTCCTGC TGGAGTTAAA TGTCCCGCGG 

+2 laLeuProCys CysProGly LeuArgCysIle GlyGlyVal AsnAsnLys 
201 CTCTTCCTTG CTGCCCCGGA CTTCGCTGCA TCGGCGGCGT CAACAACAAG 

+2 ValCysArg*** 

251 GTTTGCCGGT AATTCTAGTG TCGCAACTTT TGAGCGTGGG ATAAGTATGC 

301 TTCGTTCGTT GTATGGAGTT CTCCTCCGGA GTTTAAGCTC GGCCGGTCGA 

351 CAGCGGGTCT GCTATACTTG ATCTTACAGC GATACTATTG ATAGAAATGC 



401 ACATCTTCAT TCATGCGTCA TGAAAAAAAA AAAAAAAA 



Signal peptide sequence indicated in bold type 
Mature peptide sequence underlined 
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Figure 3 
(SEQIDNO 9) 

GGCAAGATCT GCACTCCTGC TGGAGTTAAA TGTCCCGCGG CTCTTCCTTG 
CTGCCCCGGA CTTCGCTGCA TCGGCGGCGT CAACAACAAG GTTTGCCGGT AA 



Figure 4 

A 

Signal-Gene Fusion 

PEPTIDE GENE 

> 



Ncol - Kpnby SIGNAL PEPTIDE ^ \s*cl - Hindlll 




iritron end 



